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Abstract 

We study the properties of metrics aimed at the characterization 
of grid- like ordering in complex networks. These metrics are based 
on the global and local behavior of cycles of order four, which are 
the minimal structures able to identify rectangular clustering. The 
analysis of data from real networks reveals the ubiquitous presence of 
a high level of grid-like ordering that is non-trivially correlated with 
the local degree properties. These observations provide new insights 
on the hierarchical structure of complex networks. 

Many networks arising in social, biological, and technological contexts are 
growing and self-organizing systems which are not modeled by any supervis- 
ing entity, nor follow an externally denned blueprint Jl|, [|, ||, f|, || . Empirical 
evidences, indeed, have prompted that most of the times the network's topol- 
ogy exhibits complex features which cannot be explained by merely extrap- 
olating the local properties of their constituents. The most relevant among 
these features are the small-world property || [3| and a high level of hetero- 
geneity, usually reflected in a scale-free behavior of the network's connectivity 
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0. While these properties would prompt to a very large degree of random- 
ness, yet real networks exhibit a surprising level of structural ordering. This 
fact has been first pointed out by noting the common property of many net- 
works to form cliques in which every element is linked to every other element; 

1. e. the presence of a high clustering coefficient ||, defined as the fraction of 
triangles present in the network. The identification of hidden ordering and 
hierarchies in the seemingly haphazard appearance of real networks is there- 
fore a major area of study, aimed at understanding their basic organizing 
principles. This activity has led to a harvest of results concerning nontriv- 
ial correlation properties among the various elements of natural networks, 
suggesting the presence of interesting modular organizations || ||, [10], [11 . 



In this paper we point out that the usual clustering coefficient is in some 
cases unable to quantify the order underlying a network's structure. In par- 
ticular, a general modular structure is represented by a grid-like frame, such 
as a regular hypercubic lattice, that can be adequately quantified only by 
evaluating the frequency of rectangular loops appearing in the network. We 
introduce a grid coefficient that allows us to uncover the presence of a surpris- 
ing level of grid ordering in several real networks ranging from technological 
(the physical Internet) to social (scientific collaboration network) systems. 
By correlating the presence of grid-like structures with the local connectivity 
properties we are able to uncover the presence of a scaling hierarchy that 
appears to be a widely present organizing principle. In some cases, the scal- 
ing behavior of the grid clustering is very similar to the triangle clustering, 
suggesting a kind of statistical self-similarity in the modular construction of 
the network. 

A network (or graph in the mathematical language |[L2|| ) is a set of ver- 
tices and edges joining pairs of vertices, representing individuals and the 
interactions among them, respectively. Two features play a special role in 
the characterization of complex networks. The first one refers to the small- 
world concept ||: i.e. the small average distance in terms of number of edges 
between any two vertices in the system. The second consists in a very high 
heterogeneity, usually reflected in a scale-free distribution P(k) ~ fc~ 7 for 
the probability that any given vertex has degree k; i.e. k edges to other 
vertices [[/J. Both properties appear as ubiquitous in dynamically growing 
networks |1], |2|. Real networks also show a large degree of local clustering 
and correlations. A first quantitative measurements of these properties is 
provided by the clustering coefficient |J and the average nearest neighbors 
degree || |I0| . In particular, the clustering coefficient q of the vertex i, with 
degree ki, is defined as the ratio between the number of edges in the sub- 
graph identified by its nearest neighbors and its maximum possible value, 
kiiki — l)/2, corresponding to a complete sub-graph, i.e. Cj = Itijkiiki — 1). 
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The average clustering coefficient (c) is defined as the average value of Cj over 
all the vertices in the graph, (c) = ^ Cj/iV, where N is the size (total num- 
ber of vertices) of the network. This magnitude quantifies the tendency that 
two vertices connected to the same vertex are also connected to each other; 
therefore it measures the ordering in the system. By comparison, random 
graphs Jl3| are not clustered, having (c) = (k)/N, where (k) is the average 
degree, while regular lattices tend to be highly clustered with their neighbors. 
Further information can be extracted if one computes the average clustering 
coefficient c(k) function of the vertex degree k 101. 

In the physics terminology, the study of the clustering coefficient c(k) is 
strictly related to the analysis of three-point correlation functions The 
absolute average value — as well as the scaling with k — of this quantity are 
fundamental to discriminate the level of randomness and the organizing prin- 
ciples related to the basic hierarchies present in the networks. For instance, 
a large class of scale-free networks shows a clustering coefficient decaying 



as a power-law as a function of the vertex's degree [JTTj] . This implies that 
low degree vertices tend to form connected cliques with other vertices, while 
large connected vertices (hubs) tend to act as bridges between unconnected 
cliques, thus showing a small clustering coefficient. This fact highlights the 
existence of some modular building, identified by the cliques of small degree 
vertices UTTI . 



With the aim of unveiling the hidden ordering in complex networks the use 
of the two- and three-point correlations is however not always sufficient. As a 
very simple example we can consider a rectangular lattice or grid, Fig. |l|(a). 
In this case it easy to recognize that the clustering coefficient is not able 
to distinguish any ordering in a grid-like structure, since its value is always 
null. However, it is a good measure of order for other regular structures, 
such as a triangular lattice, Fig. |l](b). Since grid-like structures are among 
the preferred ordered patterns in natural systems, we introduce as a fur- 
ther quantitative characterization of networks' regularity some metrics that 
naturally account for rectangular symmetries |15|, [16|, [H] . We start by con- 
sidering the closed paths in a network in which all edges and vertices are 
distinct. These closed paths are known as cycles |12|]. Cycles of length 3 
(i.e. composed by three vertices) are called triangles. The ratio between the 
number of triangles that include the vertex i, e«, and its maximum possible 
number, kiiki — l)/2, defines the triangle coefficient of the vertex i, which 
is by definition equal to its clustering coefficient Cj. Cycles of length 4 are 
called quadrilaterals. In the spirit of the clustering coefficient, we want to 
improve the measurement of the network structure by using the grid coef- 
ficient, c^, that is defined as the fraction of all the quadrilaterals passing 
by the vertex i, Q iy divided by the maximum possible number of quadri- 
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laterals sharing the vertex i, Zi. Interestingly, the grid coefficient can be 
further decomposed by noting that each quadrilateral passing by % is com- 
posed by the vertex i itself plus three external vertices. Quadrilaterals can 
be therefore classified according to the nature of the external vertices, see 
Fig. [| If all the external vertices are nearest neighbors of i, they form 
a primary quadrilateral; on the other hand, if one of the external vertices 
is a second neighbor of i, the cycle they form is a secondary quadrilateral. 
If the vertex i has degree fcj and it is connected to fc" n second neighbors, 
it is easy to check that the maximum number of primary quadrilaterals is 
Zf = 3xg) = k i (k l -l)(k i -2)/2, while the maximum number of secondary 
quadrilaterals is Zf = k™ n ki(ki — l)/2. In this way, in order to study the grid 
properties of a network, we can define three magnitudes: the primary grid 
coefficient, c p 4i = Q?/Zf, the secondary grid coefficient c s Ai = Qf/Zf, and 
the total grid coefficient c 4j j = (Q? + Qf)/(Zf + Zf), where Q? and Qf are 
the actual number of primary and secondary quadrilaterals passing by the 
node i, respectively. The respective average grid coefficients are defined by 
averaging these quantities over all vertices in the network. 

As an example of this definition, let us consider the rectangular lattice 
represented in Fig. |l](a), in which each vertex % has 4 nearest neighbors and 8 
second neighbors. There are no primary quadrilaterals passing by any node 
i, while the number of secondary quadrilaterals is Q s = 4. From here we 
obtain (c^) = 0, (cf) = 1/9, and (04) = 1/15. On the other hand, in the 
triangular lattice, Fig. |I|(b), in which each vertex has 6 nearest neighbors 
and 12 second neighbors, we find 6 primary quadrilaterals and 6 secondary 
quadrilaterals, which yield (c^) = 1/10, (c|) = 1/30, and (c 4 ) = 1/20. Thus, 
regular grids exhibit a finite grid coefficient, in opposition to the clustering 
coefficient, which is zero for any hypercubic lattice. 

A very different case is represented by the Erdos-Renyi random graph 



13| , pL8|, |19] , constructed from a set on N vertices that are joined in pairs by 
an edge with probability p. In this case the emerging network has average 
degree (k) = pN and a Poisson degree distribution, and it is completely 
random, so that any ordering is absent in the infinite size limit. It is possible 
to calculate easily the grid coefficients for this network. For any vertex i, we 
need at least three nearest neighbors to construct a primary quadrilateral. 
Given this configuration, the probability to close the cycle in any of the 
three possible quadrilaterals is given by the probability p 2 to draw two edges 
between two of the three nearest neighbors. Therefore we have that for 
any vertex c p Ai = p 2 . This implies that an Erdos-Renyi random graph of 
N vertices and given average connectivity (k) has an average primary grid 
coefficient (c|) = ({k)/N) 2 ~ iV~ 2 . The calculation for the secondary grid 
coefficient is slightly more involved. In this case, for any vertex i, we need 
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at least two nearest neighbors and a second neighbor. This last vertex, 
being a second neighbor, is connected to at least one nearest neighbor, but 
not necessarily to any of the two nearest neighbors that will compose the 
quadrilateral. If the second neighbor is not a priori connected to the two 
nearest neighbors, then the probability of finding a quadrilateral is of order 
p 2 . On the other hand, if it is a priori connected to one of the selected 
nearest neighbors, the probability of closing a quadrilateral is just p; i.e. 
the probability of drawing an edge between the second neighbor and the 
remaining nearest neighbor. This last instance (that the second neighbors is 
a priori connected to one of the nearest neighbors considered) happens with 
probability 1/ki, where ki is the degree of the vertex i. Therefore, at leading 
order in p, we have that for any vertex c\ i = p/ki. The average secondary 
grid coefficient is then given by (c^) = ^ k>2 P{k)p/k = {1/k)' (k)/N, where 
P{k) is the degree distribution of the Erdos-Renyi random graph fI5|| . By 
summing both contributions we have that at leading order, the average total 
grid coefficient is scaling as (04) ~ p. In general thus the grid coefficient for 
random graphs of given average connectivity scales as (04) ~ iV -1 with the 
number of vertices N. 

In order to characterize the level of grid-like ordering in real networks, we 
have measured the grid coefficients in four different systems, characterized by 
a scale-free degree distribution, which have been the focus of several recent 
studies: 

Internet: Internet map at the Autonomous System (AS) level, as of 22nd 
November 1999 || ^, |20|- These maps are collected and made publicly avail- 
able by the National Laboratory for Applied Network Research^. Each AS 
refers to one single administrative domain of the Internet. Different ASs are 
in most cases connected through a Border Gateway Protocol (BGP) that 
identifies any AS through a 16-bit number. The map considered is composed 
by 6243 ASs acting as vertices and by 12113 BGP peer connections, acting 
as edges, yielding an average degree (k) = 3.88. 

World- Wide- Web: Map of the World- Wide- Web collected at the domain 
of Notre Dame University^ |21], |22], E3J. This network is actually directed, 



but we have considered it as non-directed. The map is composed by 325729 
web pages, represented by vertices, and 1090108 hyperlinks pointing from 
one page to another, represented by edges, which corresponds to an average 
degree (k) = 6.69. 

*The National Laboratory for Applied Network Research (NLANR), sponsored by the 
National Science Foundation, provides Internet routing related information based on Bor- 
der Gateway Protocol data (see http://moat.nlanr.net/). 

^Data publicly available at http://www.nd. edu/^networks. 
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Movie actor collaborations: Network of co-actorship obtained from the 
Internet Movie Database^ §, [7], ||, |5| . In this case the 82583 actors represent 



the vertices of the graph. An edge is drawn between two actors if they have 
played together in at least one movie. The total number of edges is 3666738, 
with an average degree (k) = 88.80. 

Scientific collaborations: Network of scientific collaborations collected 
from the condensed matter preprint database at Los Alamos^ |27], |28 



This web site hosts the largest collection of preprints in condensed matter 
physics. The graph is composed by 16264 different authors, that are con- 
nected by one edge if they have coauthored a joint paper. The total amount 
of collaborations (edges) is then 47594, yielding an average degree (k) = 5.85. 
One can assign a weight to such links, by counting how many joint paper 
are present in the repositories. In this first approach we only considered the 
topological connections given by the unweighted links. 

In Table [1] we report the different average grid coefficients for all the 
networks analyzed, compared with those corresponding to the Erdos-Renyi 
random graph, the rectangular grid and the triangular lattice. It is interesting 
to note that the average grid coefficients of all networks are two to four 
orders of magnitude larger than the corresponding coefficients of a random 
graph with the same average degree and size N. While the small-world 
property and the scale-free degree distribution common to all these networks 
are generally associated to randomness and large fluctuations, the presence of 
large grid coefficient makes those graphs reminiscent of a grid-like ordering, 
very probably due to the presence of hierarchical structures and well-defined 
communities. 

More information can be gathered by studying the grid coefficient as a 
function of the vertex's degree k (i.e. by considering the average value c^k) 
of the total grid coefficient for all the vertices with the same degree k). As 
similarly noticed for the clustering coefficient [§, [TTJ] , the grid coefficient is 
well approximated in most cases by a power-law decay for increasing k. This 
feature indicates a correlation between the vertices' degree and the local 
network structure. In particular, low degree vertices are arranged in fairly 
ordered patterns whose building blocks are triangular and rectangular struc- 
tures. Vertices with large degree act as the network backbone by connecting 
the highly clustered regions. Since we are facing power-law behavior for the 
clustering and grid coefficient, we have that no characteristic length scales are 



^The source of the data is the Internet Movie Database at http://www.imdb.com. 
A collection of edges where the actors have been numbered is available at 
http://www.nd.edu/~networks. 

3 Los Alamos National Laboratory (LANL) preprint database located at 



http: //xxx. lanl.gov/archive/cond-mat 
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present in the system and thus there is a continuum hierarchy of structures. 
It is also worth remarking that the grid coefficient is always smaller than the 
clustering coefficient because it represents a measure of longer range corre- 
lations. Even though statistical fluctuations are comparable, in some cases 
the grid coefficient appears to be less susceptible to noise than other metrics. 
Finally, we note the presence of two classes of networks: the first with a scal- 
ing of the c±{k) very similar to c(k) (such as the technological Internet and 
World- Wide- Web networks), and a second one with c^{k) different from c(k) 
(as is the case of the social networks). When the power- law behavior is alike, 
we can talk of self-similar networks in which both rectangular and triangular 
patterns are equally implemented in the modular construction of the net- 
work. In the second situation, one of the two patterns is abandoned earlier 
in the hierarchical construction of the graph, breaking the self-similarity at 
all levels of the hierarchy. 
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Figure 1: (a) Regular square lattice. Nearest neighbors of a vertex (empty 
circles) are not neighbors of each other. Therefore the clustering coefficient 
q = for every vertex i. (b) Triangular lattice. Here some of the neighbors 
are connected to each other. In particular 2 out of every 5 possible edges are 
drawn; hence q = 2/5 for all the vertices 



a) b) 




Figure 2: (a) Example of a primary quadrilateral, in which the three exter- 
nal vertices are nearest neighbors of the vertex i. (b) Example of a secondary 
quadrilateral in which one of the external vertices (empty square) is a second 
neighbor of the vertex i . 
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Figure 3: Clustering coefficient c(k) (hollow symbols) and grid coefficient 
04(h) (filled symbols) as a function of the node degree, for the networks 
considered, (a) Internet at the AS level, (b) Map of the World-Wide- 
Web domain collected at www.nd.edu. (c) Network of co-actorship from the 
Internet Movie Database, (d) Scientific collaborations from the cond-mat 
preprint database. 
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(k) 


(c) 


<<5> 


(ct) 


(C4> 


Internet 


3.88 


0.25 


0.043 


0.028 


0.028 


World-Wide-Web 


6.69 


0.23 


0.14 


0.088 


0.090 


movie actor collaborations 


88.80 


0.75 


0.66 


0.009 


0.027 


scientific coauthorship 


5.85 


0.64 


0.40 


0.036 


0.12 


Erdos-Renyi random graph 


Np 


~ N- 1 


-AT" 2 


~ N- 1 


~ A^ _1 


square lattice 


4 








1/12 


1/15 


triangular lattice 


6 


2/5 


1/10 


1/30 


1/20 



Table 1: Average degree, clustering coefficient, and primary, secondary, and 
total grid coefficients for the different networks considered (see text). 
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